Goto

Collaborating Authors

 episodic memory-based lifelong learning


Review for NeurIPS paper: Improved Schemes for Episodic Memory-based Lifelong Learning

Neural Information Processing Systems

There has been a plethora of recent and historical work on this topic, finding different ways to help networks alleviate the issue of catastrophic forgetting --- where a network trained on tasks A_0 through A_i, forgets these to differing degrees when trained on tasks A_i 1 onward. Most methods can be divided into regularisation based, memory based or meta-learning based. One relatively recent work is GEM (gradient of episodic memory) (and relatedly A-GEM). This works by storing examples from seen tasks in an episodic memory. When learning a new task, the gradient update is modified such that it does not increase the loss on examples from previous tasks (these are represented by the examples in memory).


Review for NeurIPS paper: Improved Schemes for Episodic Memory-based Lifelong Learning

Neural Information Processing Systems

The paper introduces a clear, simple generalisation of two established continual learning methods (GEM and A-GEM) which performs very well in a thorough empirical evaluation. All reviewers and the AC value the effort that the authors put in their response. There is consensus that the work has merit and all reviewers recommend accepting the paper (R1 and R4 raised their score).


Improved Schemes for Episodic Memory-based Lifelong Learning

Neural Information Processing Systems

Current deep neural networks can achieve remarkable performance on a single task. However, when the deep neural network is continually trained on a sequence of tasks, it seems to gradually forget the previous learned knowledge. This phenomenon is referred to as catastrophic forgetting and motivates the field called lifelong learning. Recently, episodic memory based approaches such as GEM and A-GEM have shown remarkable performance. In this paper, we provide the first unified view of episodic memory based approaches from an optimization's perspective.